Position Paper: Dataset profling for un-Linked Data
نویسندگان
چکیده
The vast amount of data on the web presents a growing need to advance data search. Rich and meaningful metadata can enhance the discovery of datasets and establish connections between them. Where metadata is not comprehensive, it can be expanded through dataset profiling. The relative importance of different types of profiles varies depending on the user’s context and the objective of the task. We discuss an approach to find un-Linked datasets and increase result relevance by offering related information. We propose generating rich profiles for datasets; counting the number and strength of relations between them and showing a graph of profiles that represents connections between different datasets. We can thereby capture correlations between datasets that can then improve the efficiency and effectiveness of data search. If developed further this would improve discoverability and reusability of datasets.
منابع مشابه
An improved opposition-based Crow Search Algorithm for Data Clustering
Data clustering is an ideal way of working with a huge amount of data and looking for a structure in the dataset. In other words, clustering is the classification of the same data; the similarity among the data in a cluster is maximum and the similarity among the data in the different clusters is minimal. The innovation of this paper is a clustering method based on the Crow Search Algorithm (CS...
متن کاملAnalysis of site frequency spectra from Arabidopsis with context-dependent corrections for ancestral misinference.
Previous studies have shown that the pattern of single nucleotide polymorphism (SNP) in Arabidopsis (Arabidopsis thaliana) deviates from the distribution expected under a neutral model. Here, we test whether or not ancestral misinference could explain this deviation. We start by showing that there are significant and complex influences of context on mutation dynamics as inferred from SNP freque...
متن کاملData preparation techniques for a perinatal psychiatric study based on linked data
BACKGROUND In recent years there has been an increase in the use of population-based linked data. However, there is little literature that describes the method of linked data preparation. This paper describes the method for merging data, calculating the statistical variable (SV), recoding psychiatric diagnoses and summarizing hospital admissions for a perinatal psychiatric study. METHODS The ...
متن کاملA Linked Dataset of medical educational resources
Reusable educational resources became increasingly important for enhancing learning and teaching experiences, particularly in the medical domain where resources are particularly expensive to produce. With respect to this, research has aimed at improving interoperability across educational resources metadata repositories, which led to a fragmented landscape of competing metadata schemas, such as...
متن کاملLinked Web APIs Dataset Web APIs meet Linked Data
Web APIs enjoy significant increase in popularity and usage in the last decade. They have became the core technology for exposing functionalities and data. Nevertheless, due to the lack of semantic Web API descriptions their discovery, sharing, integration, and assessment of their quality and consumption is limited. In this paper, we present the Linked Web APIs dataset, an RDF dataset with sema...
متن کامل